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^C) ' Abstract 

We show that a randomly chosen 3-CNF formula over n variables with clauses-to- variables 
ratio at least 4.4898 is, as n grows large, asymptotically almost surely unsatisfiable. The 
previous best such bound, due to Dubois in 1999, was 4.506. The first such bound, inde- 
pendently discovered by many groups of researchers since 1983, was 5.19. Several decreasing 
values between 5.19 and 4.506 were published in the years between. The probabilistic tech- 
niques we use for the proof are, we believe, of independent interest. 

1 Introduction 
> 

O 
O 
\Q ■ Satisfiability of Boolean formulas is a problem universally believed to be hard. Determining the 

source of this hardness will lead, as is often stressed, to applications in domains even outside 

the realm of mathematics or computer science; moreover, and perhaps more importantly, it will 

enhance our understanding of the foundations of computing. 

In the beginning of the 90's several groups of experimentalists chose to examine the source 
of this hardness from the following viewpoint: consider a random 3-CNF formula with a given 
clauses-to- variables ratio, which is known as the density of the formula. What is the probability 
of it being satisfied and how does this probability depend on the density? Their simulation 
results led to the conclusion that if the density is fixed and below a number approximately equal 
to 4.25, then for large n, a randomly chosen formula is almost always satisfiable, whereas if 
the density is fixed and above 4.25, a randomly chosen formula is, for large n, almost always 
unsatisfiable. More importantly, around 4.25 the complexity of several well known complete 
algorithms for checking satisfiability reaches a steep peak (see e.g. [10, 15, 16]). So, in a certain 
sense, 4.25 is the point where from an empirical, statistical viewpoint, the "hard" instances of 
SAT are to be found. Similar results were obtained for other combinatorial problems, and also 
for fc-SAT for values of k > 3. 

These experimental results were followed by an intense activity to provide "rigorous results" 
(the expression often used in this context to refer to theorems). Perhaps the most important 
advance is due to Friedgut: in [7] he proved that there is a sequence of reals (c n ) n such that 
for any e > the probability of a randomly chosen 3-CNF-formula with density c n — e being 
satisfiable approaches 1 (as n — ► oo), whereas for density c n + e, it approaches 0. Intuitively, 
this means that the transition from satisfiability to unsatisfiability is sharp, however it is still 
not known if (c n ) n converges. 
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Despite the fact that the convergence of (c n ) n is still an open problem, increasingly improved 
upper and lower bounds on its terms have been computed in a rigorous way by many groups of 
researchers. The currently best lower bound is 3.52 [9, 2]. 

With respect to upper bounds, which is the subject of this work, the progress was slower but 
better, in the sense that the experimentally established threshold is more closely bounded from 
above, rather than from below. A naive application of the first moment method yields an upper 
bound of 5.191 (see e.g., [6]). An important advance was made in [8], where the upper bound 
was improved to 4.76. In the sequel, the work of several groups of researchers, based on more 
refined variants of the first moment method, culminated in the value of 4.571 [4, 11] (see the nice 
surveys [12, 3] for a complete sequence of the events). The core idea in these works was to use the 
first moment method by computing the expected number of not all satisfying truth assignments, 
but only of those among them that are local maxima in the sense of a lexicographic ordering, 
within a degree of locality determined by the Hamming distance between truth assignments 
(considered as binary sequences). For degree of locality 1, this amounts to computing the 
expected number of satisfying assignments that become unsatisfying assignments by flipping any 
of their "false" values (value 0) to "true" (value 1). Such assignments are sometimes referred to 
as single-flip satisfying assignments. 

The next big step was taken by Dubois et al. [5], who showed that 4.506 is an upper bound. 
Instead of considering further variations of satisfiability, they limited the domain of computa- 
tions to formulas that have a typical syntactic characteristic. Namely, they considered formulas 
where the cardinality of variables with given numbers of occurrences as positive and negative 
literals, respectively, approaches a two dimensional Poisson distribution. Asymptotically almost 
all formulas have this typical property (we say that such formulas have a Poisson 2D degree 
sequence). It turns out that the expectation of the number of single- flip satisfying assignments 
is exponentially reduced when computed for such formulas. To get the afore mentioned upper 
bound, Dubois et al. further limited the domain of computations to formulas that are positively 
unbalanced, i.e. formulas where every variable has at least as many occurrences as a positive 
literal as it has as a negated one. 

A completely different direction was recently taken in [13]. Their work was motivated by 
results on the geometry of satisfying assignments, and especially the way they form clusters 
(components where one can move from one satisfying assignment to another by hops of small 
Hamming distance). Most of these results were originally based on analytical, but non-rigorous, 
techniques of Statistical Physics; lately however important rigorous advances were made [1, 14]. 
The value of the upper bound obtained by Maneva and Sinclair (see [13]) was 4.453, far below 
any other upper bound presently known (including the one in this paper). However it was proved 
assuming a conjecture on the geometry of the satisfying truth assignments which is presently 
proved only for fe-SAT for k > 8 in [1]. 

In this paper, we show that 4.4898 is an upper bound. Our approach builds upon previous 
work. It makes use (i) of single-flip satisfying truth assignments, (ii) of formulas with a Poisson 
2D degree sequence and (iii) of positively unbalanced formulas. 

We add to these previously known techniques two novel elements that further reduce the 
expectation computed. Our approach is rigorous: although we make use of computer programs, 
the outputs we use are formally justified. What is interesting is not that much the numerical 
value we get, although it constitutes a further improvement to a long series of results. The main 
interest lies, we believe, on one hand in the new techniques themselves and on the other in the 
fact that putting together so many disparate techniques necessitates a delicately balanced proof 
structure. 

First, we start by recursively eliminating one- by-one the occurrences of pure literals from the 
random formula, until we get its impure core, i.e. the largest sub-formula with no pure literals (a 
pure literal is one that has at least one occurrence in the formula but whose negation has none). 
Obviously this elimination has no effect on the satisfiability of the formula. Since we consider 



random formulas with a given 2D degree sequence, we first have to determine what is the 2D 
degree sequence of the impure core. For this, we use the differential equation method [18]. The 
setting of the differential equations is more conveniently carried out in the so called configuration 
model, where the random formula is constructed by starting with as many labelled copies of each 
literal as its occurrences and then by considering random 3D matchings of these copies. The 
matchings define the clauses. The change of models from the standard one to the configuration 
model with a Poisson 2D degree sequence is formalized in Lemma 2.1. We also take care of 
the fact that the configuration model allows formulas with (i) multiple clauses and (ii) multiple 
occurrences of the same variable in a clause, whereas we are interested in simple formulas, i.e. 
formulas where neither (i) nor (ii) holds. For our purposes, it is enough to bound from below 
the probability of getting a simple formula in the configuration model by e~ ' n gn ', see 
Lemma 2.2. The differential equations are then analytically solved, and we thus obtain the 2D 
degree sequence of the core, see Lemma 2.3. 

Second, we require that not only the 2D degree sequence is Poisson, but also that the numbers 
of clauses with none, one, two and three positive literals, respectively, are close to the expected 
numbers. Notice that these expected numbers have to reflect the fact that we consider positively 
unbalanced formulas. This is formalized in Lemma 2.4. 

The expectation of the number of satisfying assignments, in the framework determined by 
all the restrictions above, is computed in Lemma 2.5. This expectation turns out to be given by 
a sum of polynomially many terms of functions that are exponential in n. We estimate this sum 
by its maximum term, using a standard technique. However in this case, finding the maximum 
term entails maximizing a function of many variables whose number depends on n. To avoid 
a maximization that depends on n we prove a truncation result which allows us to consider 
formulas that have a Poisson 2D degree sequence only for light variables, i.e. variables whose 
number of occurrences, either as positive or negated literals, is at most a constant independent 
of n. 

Then we carry out the maximization. The technique we use is the standard one by Lagrange 
multipliers. We get a complex 3x3 system which can be solved numerically. We formally prove 
that the system does not maximize on the boundary of the system and we make a sweep over 
the domain which confirms the results of the numerical solution. 

Due to lack of space, the proofs of some lemmata are deferred to the appendix of the paper. 

As usual asymptotically almost surely (a.a.s.) will mean with probability tending to 1 as 
n — > oo. All asymptotic expressions as 1 — oil) are always with respect to n. Our main result 
in the paper is the following: 

Theorem 1.1. Let 7 = 4.4898 and m = [771]. A random 3-CNF formula in J- n) m (i-c with n 
variables and m clauses, no repetition of clauses and no repetition of variables in a clause) is 
not satisfiable a.a.s. 

2 Background and Technical highlights. 

Given a set of n Boolean variables, and let m = \_^fn\ . Let J- n ,m be the set of 3-CNF formulas 
with n variables and m clauses, where repetition of clauses or repetition of variables in a clause 
is not allowed. We also denote by T nm the probability space of formulas in J- n , m drawn with 
uniform probability. Throughout the paper, we fix the value 7 = 4.4898 and prove for that value 
a random 3-CNF formula is not satisfiable with high probability. 

Throughout the paper, scaled will always mean divided by n, and a scaled natural will be a 
member of -N. Given formula <p £ J~ n ,m, we define the following parameters which depend on 
(f>: For any i,j € N, let dij be the scaled number of variables with i positive occurrences and j 
negative occurrences in <f>. Then, 

E ^ = L (^ 



The sequence d = (di,j)i,jsw is called the degree sequence of (p. The scaled number of clauses of 
(f) is denoted by c, and can be expressed by 

i,jeN 

Note that if € J- nm , then c must additionally satisfy c = [ r yn\/n. 

Given e\ > and any sequence £ = (6,i,j)i,jeN of nonnegative reals with Ylijen£i,j = 1> 
define 

A/"(n, £, ei) = |d = (djj)ij eN : J^ djj = 1, | J^ (* + jK,j G N > v *>i e N ^jn G N, 

and |djj — £jj| < e±, and if z > n ' or j > n ' then dij = >. 

Intuitively A/"(n, £, ei) can be interpreted as the set of degree sequences d which are close to the 
ideal sequence £, which in general is not a degree sequence since its entries £jj need not be scaled 
naturals. However, if n is large enough, then A/"(n,£,ei) 7^ 0. Now we consider the 2D Poisson 
ideal sequence S defined by dij = e _37 (37 / /2)* +: '/(i!j!). The following lemma reflects the fact 
that almost all 4> € T n ^ m have a degree sequence d which is close to d. A proof of an analogous 
result can be found in [5] (the main difference is that we have to apply Markov's inequality in 
the end to show that there are no variables of degree n 1 ' 6 or larger). 

Lemma 2.1. Let d be the degree sequence of a random <\> G T n ,m- For any e\ > 0, we have that 
Pr^ m (deU(n,S,e 1 )) = l-o(l). 

Given a fixed degree sequence d = (dij)ij^n satisfying (1) and such that c = c(d) defined 
by (2) is also a scaled natural, we wish to generate 3-CNF formulas with that particular degree 
sequence d. A natural approach to this is to use the configuration model. A configuration Lp 
with degree sequence d = (djj)jj 6 N is constructed as follows: consider n variables and the 
corresponding 2n literals xi, x\ . . . , x n , x n ; each literal has a certain number of distinct labelled 
copies in a way that the scaled number of variables with i positive copies and j negative copies 
is dij; then partition the set of copies into sets of size 3, which we the call clauses of (p. Let 
C n ,d be the set of all configurations with degree sequence d, and we also denote by C nt d the 
probability space on the set C n ^ with the uniform distribution. 

A 3-CNF multi-formula is a formula with possible repetition of variables in one clause and/or 
possible repetition of clauses. A simple formula is a formula in T n ^ m - Let tt be the projection 
from C nt d to 3-CNF multi-formulas obtained by unlabelling the copies of each literal. A con- 
figuration ip E C n a is satisfiable if <f> = 7r(</?) is satisfiable. A configuration 99 G C n> d is simple 
iff (j) = ir((p) is a simple formula, i.e. does not have repetition of variables or clauses. Notice 
that the number of anti-images of a simple formula <p with degree sequence d under n does not 
depend on the particular choice of 4>. Hence, 

Pr ^n,m(^ is Sat | d) = Pr Cnjd (^ is Sat | Simple). (3) 

We need a lower bound on the probability that a configuration is simple. The following 
result gives a weak bound which is enough for our purposes. 

Lemma 2.2. Let e\ > and d <G J\f(n, S, e\). Then 

Pr Cnd (Simple) > e -©(™ 1/3 i°s™), 

where the e _e ( n logn ) bound is uniform for all d G Af(n, S, ei). 



Given <p £ C Uj d, a pure variable of ip is a variable which has a non-zero number of occurrences 
which are either all syntactically positive or all syntactically negative. The only literal occurring 
in p and all its copies are also called pure. If p is satisfiable and a; is a pure variable of (p, then 
there exists some satisfying truth assignment of (p which satisfies all copies of x in p. Hence, in 
order to study the satisfiability of a p £ C n> d, we can satisfy each pure variable in p and remove 
all clauses containing a copy of that variable. For each cp G C nt d, let p be the configuration 
obtained by greedily removing all pure variables and their corresponding clauses from p. This p 
is independent of the particular elimination order of pure literals and is called the impure core of 
p. In fact, in our analysis we will eliminate only one clause containing one copy of a pure literal 
at a time (the p obtained still remains the same). Note that p is satisfiable iff p> is satisfiable. 
Moreover, if p is simple then <p is also simple (but the converse is not necessarily true). 

Furthermore, let <p be the configuration obtained from p by positively unbalancing all vari- 
ables, i.e. switching the syntactic sign of those variables having initially more negative than 
positive occurrences in p. Let C n ^ denote the probability space of configurations p>, where <p 
was chosen from C n ^d with uniform probability. Note that the probability distribution in C n) d 
is not necessarily uniform. Since the simplicity and the satisfiability of a configuration are not 
affected by positively unbalancing the variables, we have 

Pr Cn d (p is Sat A Simple) < Prs- (ip is Sat A Simple). (4) 



n .d 



Let the random variable d be the degree sequence of a random configuration in C n> d- We prove 
in the following result that if the original d is close to the ideal sequence S, then with high 
probability d must be close to the ideal sequence S = (#ij)ijeN defined by 



S i,j 



0, if i < j, 



where b = (1 — tvll)' l an d tx> is the scaled number of steps in the pure literal elimination 
algorithm. 

Proposition 2.3. Given €2 > 0, there exists e± > and < (3 < 1 such that for any d € 
A/"(n,(5,ei) 

Pr^ (deM(n,8,e 2 ))=l-0(p nl/2 ). 

Moreover, for each d E Af(n,d,€2), the probability space C rij d conditional upon having degree 
sequence d has the uniform distribution (i.e. C rit d conditional upon a fixed d behaves exactly as 

n,d' 

Let d G M(n, S, €2). Then, each p G C ^ has a scaled number of clauses of c = c(d) (see (2)). 
Moreover, let £ p and £ n be the scaled number of copies in p of positive and of negative literals 
respectively. Then 

Given any fixed p> € C 2 an d for k £ {0, . . . , 3}, let Cfc be the scaled number of clauses in p 
containing exactly k positive copies (clauses of syntactic type k). We call c = (cq, . . . ,03) the 
clause-type sequence of p. By definition 

ci + 2c 2 + 3c 3 = £ p , 3c + 2ci + c 2 = £ n , (6) 

and by adding the equations in (6), cq + • • • + 63 = c . The cq, . . . , 63 are random variables in 
C n 2, but the next result shows that if d is close enough to 6, then cq, • • • , S3 as well as their sum 



Co + • • • + C3 = c are concentrated with high probability. In order to see this, we need to define 
7 = c(S), A p = £ p (d) and A n = £ n (S) (see (2) and (5)), which can be interpreted as the limit of 
c, £ p and £ n respectively when d approaches S. In terms of these numbers, we thus define for all 

ke{0,...,3} 

~ (A A p fc A n 3 - fc ^ 

7fe= U(A P TA^ 7 (7) 

and also 7 = (70, • • • ,%)■ Then we have 71 + 272 + 373 = A p , 3% + 2% + 72 = A n and 

7o + 7i + 72 + 73 = 7- 

The next result shows that when d is close enough to S, then each Ck is close to the corre- 
sponding %. Indeed, given e > and for any d G J\f(n, 6, £2), let C e ~ be the set of all (^GC n ^ 

such that for k G {0, . . . ,3}, \ck — %| < e. We also denote by C e ~ the corresponding uniform 

probability space. 

Lemma 2.4. Given e > 0, there is €2 > and < (3 < 1 suc/i that for any d G Af(n, S, €2), 

All the previous lemmata establish a connection between the uniform probability spaces 
J- n) m and C e ~. In order to prove Theorem 1.1, it remains to bound the probability that a 
configuration ip G C e ~ is simple and satisfiable, as it is done in the following result. 

Lemma 2.5. There exists e > and < /3 < 1 smc/i i/iat /or any d G AT(n, 5, e), 

Pr^ „(Sat A Simple) = 0((3 n ). 

The proof of Lemma 2.5 is given in Section 3 below. Finally, we can complete the proof of 
our main result. 

Proof of Theorem 1.1. Choose e, £2,ei > and < /3 < 1 which satisfy the statements of 
Lemmata 2.3, 2.4 and 2.5. We can assume that ei < €2 < e. Let A/i = M(n,S,e2) C M(n,S,e). 
Using Lemmata 2.4 and 2.5, for any d G A/i 

Pr c - (Sat A Simple) < Pr^ _ (Sat A Simple) + 0(/3 n ) = 0(/T). (8) 

Let A/2 = A/ r (n, 5, ei), and consider any given d G A/2. Then from (4), (8) and Lemma 2.3, 
Pr Cn d (SAT A Simple) < Prg- (Sat A Simple) 

= Pr C„, d ( 2 ^)+ E Prc ni3 (SATASlMPLE)Pr c ^(d) 

deNi 
< 0((3 nl/2 ) + 0(/3")Pr 4d (d G A/i) = 0(/3" V2 ), (9) 

where the events (d) and (d G A/i) on the probability space C„ jC z denote that the degree sequence 
is d and the degree sequence is in Ai- Finally, from (3), Lemma 2.1, Lemma 2.2 and (9) we get 

Pr^„, m (SAT) = Pr^ m (d <£M 2 )+J2 Pr CM( SAT I SiMPLE)Pr^ im (d) 

deM 2 
m ■ y^ Pr ^. d (Sat A Simple) 
= ° (1) + 2J Prc - (Simple) Pr ^" (d) 

< o( l )+e e(n 1 /3logn) J2 Pr Cn , d (SATASlMPLE)Pr^ m (d) 
deAT 2 

< (i) +e e(« 1/3l °s«)0(/3" 1/2 )Pr^ m (d G A2) = o(l), 

where (d) and (d G A/2) denote respectively the event that a random formula G .T^m has 
degree sequence d and the event that the degree sequence of <\> belongs to A/2. □ 



3 Proof of Lemma 2.5 

Let A/" (re, S, 7, e) be the set of tuples (d,c) such that d G Af(n, S, e) and c = (ck)o<k<3 is a tuple 
of scaled naturals satisfying (6) (recall also from (5) the definition of £ p and £ n ), and moreover 
|cfc — %| < e. For each (d, c) £ A/"(n, d, 7, e), we define C n ^- to be the uniform probability space 

of all configurations with degree sequence d and clause-type sequence c. In order to prove the 
lemma, it suffices to show that for any (d,c) £ J\f(n, <5,7,e) we have 

Pr Cn3e (Sat A Simple) = 0(/? n ). 

Hence, we consider d, c and the probability space C n d g to be fixed throughout this section, and 
we try to find a suitable bound for Pr(SAT A Simple). 

We need some definitions. Let us fix any given configuration <p G C -i~. A light variable of ip 
is a variable with i < M positive occurrences and j < M negative occurrences in ip (we will use 
here and in the numerical calculations that follow the value M = 23). The other variables are 
called heavy. We consider a weaker notion of satisfiability in which heavy variables are treated 
as jokers and are always satisfied regardless of their sign in the formula and their assigned value. 
Given a configuration ip E C ,~ an d a truth assignment A, we say that A \= tp iff each clause of 
ip contains at least one heavy variable or at least one satisfied occurrence of a light variable. Let 
Sat be the set of configurations w£C o~ for which there exists at least one truth assignment 

r n,d,c ° 

A such that A \=^ ip. Clearly, if A \= p, then also A |= b ip, and hence Sat C SAT b . We 
still introduce a further restriction to satisfiability in a way similar to [11] and [4], in order to 
decrease the number of satisfying truth assignments of each configuration without altering the 
set of satisfiable configurations (at least without alterating this set for simple configurations). 
Given a configuration p € C o- and a truth assignment A, we say that A \= ip iff A \= <p 
and moreover each light variable which is assigned the value zero by A appears at least once 
as the only satisfied literal of a blocking clause (i.e. a clause with one satisfied negative literal 
and two unsatisfied ones). Let Sat be the set of configurations which are satisfiable according 
to this latter notion. Notice that if p £ Simple, then p e Sat iff ip € Sat (by an argument 
analogous to the one in [11] and [4]). Therefore, we have 

Pr(SAT A Simple) < Pr(SAT b A Simple) = Pr(SAT b/ A Simple) < Pr(SAT b/ ). 

Let X be the random variable counting the number of satisfying truth assignments of a randomly 
chosen configuration ip E C- n 2,c m *^ e ^ AT ' sense - We need to bound 



\{(p,A) : <p€_C nA£ , Ap>}| 

\C 



Pr(SAT w ) = Pr(X > 0) < EX = "'"'" . (10) 



In the following subsection, we obtain an exact but complicated expression for EX by a counting 
argument, and then we give a simple asymptotic bound which depends on the maximization of 
a particular continuous function over a bounded polytope. The next subsection contains the 
maximization of that function. 

3.1 Asymptotic bound on EX 

First, we compute the denominator of the rightmost member in (10). 

re \ / £ p n \ ( £ n n \ (3cbn)! {2c\ri)\ (2c2n)! (Scsn)\ 



1 ra,a,< 



{di tj n)ijj Vein, 2c 2 n, 3c 3 ?V V3c n, 2cin, c 2 rey (c n)!6 c ° n 2^ n 2 c ^ n (c 3 re)!6 c 3 r 
re! (£ p n)!(£ n n)! 



ni J (^j«)! 2 " n3( " 0+ " 3)n Po«)Kcin)!(c 2 re)!(c 3 n)! 



In order to deal with the numerator in (10), we need some definitions. Let us consider any fixed 
(p G C 2~ and any assignment A such that A \= 92. We will classify the variables, the clauses 
and the copies of literals in (p into several types, and define parameters counting the scaled 
number of items of each type. Variables are classified according to their degree. A variable is 
said to have degree (i,j) if it appears i times positively and j times negatively in (p. Let £ and 
TC, respectively, be the set of possible degrees for light and heavy variables. 

£ = {(i,j) e N 2 : < i,j < M}, H = {(i,j) G N 2 : i > M or j > M}. 

We also consider an extended notion of degree for light variables which are assigned by A. 
One of such variables has extended degree (i,j, k) if it has degree (i,j) and among its j negative 
occurrences k appear in a blocking clause (being the only satisfied literal of the clause). Let 

£' = {(ij } k) G N 3 : < % < M, l<k<j< M}, 

be the set of possible extended degrees for these light 0- variables. For each (i,j) G £, let tij be 
the scaled number of light variables assigned 1 by A with degree (i,j) in p. For each (i,j, k) G £', 
let fij t k be the scaled number of light variables assigned by A with extended degree (i,j, k) in 
p. We must have 

3 

hj + ^2 foj> k = di >^ V(i,j) G £. (11) 

fc=i 

On the other hand, we classify the copies of literals occurring in p into five different types 
depending on their sign in ip, their assignment by A and whether they belong or not to a 
blocking clause. Each copy receives a label from the set 

S = {ps,nsl,ns2,pu, nu}, 

where the labels ps, pu, nsl, ns2 and nu denote positive-satisfied, positive-unsatisfied, negative- 
satisfied inside a blocking clause, negative-satisfied inside a non-blocking clause and negative- 
unsatisfied, respectively. It is useful to consider as well coarser classifications of the copies of 
literals in ip and thus we define the types p, n and ns which correspond to positive, negative and 
negative-satisfied copies, respectively. Also, let 

S' = {ps,ns,pu,nu} and S" = {p,n}. 

For each of the types o G S U S' U S" that we defined, let £ a be the scaled number of copies of 
type a. Note that £ p and £ n were already defined (see (5) and (6)). Also, let h a be the scaled 
number of copies of type a which come from heavy variables (recall that these copies are always 
satisfied by definition regardless of their sign) . In view of 

H n 

and of (5) and (6), we observe that £ p , £ n , /i ps and /i ns are constants which do not depend on 
the particular choice of (p>,A). The parameters /i ns i and h nS 2 depend on the particular (cp, A) 
and satisfy 

frnsl + ^ns2 = ^ns- (12) 

The parameters £ ps , £ pu , £ ns ±, £ nS 2 and £ nu also depend on (ip,A) and can be expressed as 

£ps — / _, tH,j "T" "-ps; -t-pu — / ^Ji,j,kt ^-nsl — / KJi,j,k ~r "-nsl) 

C a a 

4s2 = ^2(j ~ k)h, 3 \k + ^ns2, 4u = ^jUj- (13) 

a c 



Finally, the clauses of (p are classified into 16 extended types (not to be mistaken with the four 
syntactic types defined immediately before (6)). Each type is represented by a 2 x 2 matrix from 
the set 



A 



f = (Ma) ns(a)\ ^ . y = + Q 



A clause is said to be of extended type a = [ . { ; { ] if for each a 6 <S' the clause 

\pu(aj nu(ajy 

contains <r(a) copies of literals of type a. Notice that all clauses of extended type a also contain 

the same number of copies of type a for all other a € S U S" and thus we can define a{a) to be 

this number. For each a £ A, let c a be the scaled number of clauses of extended type a (while 

cit, < k < 3 is the number of clauses of syntactic type k, i.e. with k positive literals). We have 

^2 c a = Cfc. (14) 

aeA 

p(a)=fc 

The parameters ^ ps , £ vu , £ ns i, £ nS 2 and £ nu can also be expressed in terms of the c a by 

L=Y1 a ( a ) c ^ VcJ G 5 - ( 15 ) 

We now consider the following equations: 

<-ps "T <-pu — tp, t ns i + *-ns2 ~r <- nu — t n V-'-^'J 

tps = / ^t,j + <lps) "nsl = / kjijk + /insl, tns2 = / (j — kjjijfi + AJ- n s2 (17 J 

£ £' £' 

V = ^ ps(a)c a , 4si = ^ nsl(a)c a , 4s2 = ^ ns2(a)c a (18) 

aG.4 aS.4 a£^ 

In view of (5) and (6), the system of equations {(11), (12), (13), (14), (15)} is equivalent to 
{(11), (12), (14), (16), (17), (18)}. 

So far we verified that the constraints {(11), (12), (14), (16), (17), (18)} express necessary 
conditions for the parameters of any particular ((/?, A), with (p € C ^~ an d A \= ip. Now we 
will see that they are also sufficient, in the sense that for each tuple of parameters satisfying the 
above-mentioned constraints we will be able to construct pairs (<p, A). 

Let t = (Uj)c, f = (fi,j,k)c, h = (K sl ,h ns2 ), c= {c a ) a eA, $= (4)aes and 

K=\C\ + \C'\ + 2 + \A\ + \S\ = (M + 1) 2 (1 + M/2) + 23. 

We define the bounded polytope V(d,c) C M K as the set of tuples x = (i, f,h,c,£) of non- 
negative reals satisfying {(11), (12), (14), (16), (17), (18)}, and consider the following set of 
lattice points in V(d,c): 



2(n,d,c) =V(d,c)n 



1. ^ K 



n 



For any tuple of parameters x € I(n, d,c), we proceed to count the number of pairs (p, A), with 
(p e C 2" an( i A N Vj satisfying these parameters. We denote this number by T(x,n,d,c). In 
order to do the counting, we first consider the number of ways of assigning to each variable a 
truth value and a degree (and an extended degree as well for light 0- variables). This gives 

\(ti,jri)c, {fi,j,kn)c, {di,jn) H 



Notice that the only constraints required at this stage for the parameters are these in (11). Then 
we have to choose for each light O-variable which k of the j negative satisfied copies contribute 
to blocking clauses, and also (taking into account (12)) which /i ns i of the h ns negative copies of 
heavy variables contribute to blocking clauses: 



n 



„ ^ }i,j,k n 

k 



h ns n 



hnsin, h ns2 n 



So far, we have assigned labels in S to all copies of literals in cp. This construction is compatible 
with (13). Now we have to decide which specific copies of each type a £ S will be used in the 
construction of clauses of each extended type a € A. The number of ways of doing that is 



n 

aes 



'a(a)c a n)aeA 



where it is sufficient that the c parameters satisfy (14) and that the £ parameters are expressible 
as in (15). Finally, the number of ways of constructing c a clauses of each extended type a £ A 

is I\a£A W ( a )i where 



W(a) 



(w(a)c a n)l(c a ny 2 - w( ^ 



(c a n)\ 2 
{2c a n)\ 



2c a n 

(3c a n)\ 



if w(a) = 1, 
if w(a) = 2, 



if w(a) = 3, 



(w(a)\) Can 

(c a n)\6 Can 
and w(a) is the number of O's in the matrix a. Putting everything together, we have that 

T(z,n,d,c) = 2£«W n . a 

\{ti,jn)c, Ui,j,kn)c, [dijn) n 



n 



Ji,j,k'^ 



hn S n 



, Y\ Y\ w( 

Ksin, h ns2 nj f- A \(a(a)c a n) a( z A J A-L 



<"> 



Hence 



EX 



1 



K 



n,d,c< 



J2 T(x,n,d,c). 



xSX(n,d,c) 

To characterize the asymptotic behaviour of T(x,n,d,c)/\C o~| with respect to n, we define 



F(x) 



n 



ctGcS^o" 



Ji.j.k 



and 



n c *.,/•' Ylc> /y,*/fi) ' ' i'„ l "~'K.2 k ~' rw «»(«)!/2)« 



B^^^j-m r c r^r^ 



c *-p 

Then we use the following form of Stirling's inequality which holds for any k € N: 



and we obtain 



^/2n{k + l/8)(k/e) k < k\ < yj2v{k + l/4)(fc/e) fc , 
T(x,n,d,c) 



K, 



< V o\ yi (n){B(d,c)F(x)) n , 



d,c< 



where poly 1 (n) is some fixed polynomial in n which can be chosen to be independent of x, d 
and c (as long as x € 2T(n, d, c) and (d,c) £ Af(n, S, 7, e)). Moreover, since the size of I(n, d,c) 
is also polynomial in n, we can write 

( ~~ V / ~~ V 

EX < poly 2 (n) B(d,c) max F(x) < poly 2 (n) B(d,c) max F(x) , 

y x&(n,d,c) J V x£V(n,d,c) J 

for some other fixed polynomial poly 2 (n). By continuity, if we choose e to be small enough, we 
can guarantee that 

EX < f (1 + W~ 7 )B max F(x) J , (19) 

y xeV(n,6,y) J 

where (recall the definition in (7)) 



H / C 

VVC°i,i 



x P ^x/ 



^HEA, ^- (20) 



w 



(37) 



3.2 Maximization of F(x) 



We wish to maximize F or equivalently logi 7 over the domain - P(n,<5,7). We first show the 
following lemma (for the proof see the appendix): 

Lemma 3.1. F(x) does not maximize on the boundary ofV(n,5,~f). 

Since log F does not maximize on the boundary of its domain, the maximum must be attained 
at a critical point of log F in the interior of V(n, d, 7). We use the Lagrange multipliers technique 
and characterize each critical point of log F in terms of the solution of a 3 x 3 system (the details 
are given in the appendix, Section E). The system is numerically solved with the help of Maple, 
which finds just one solution. We express the maximum of F over "P(n,<5,7) in terms of this 
solution, and multiply it by B given in (20), and from (19) we obtain the bound 

EX < ((1 + 1(T 7 )0.9999998965) n , (21) 

which concludes the proof of Lemma 2.5, since (1 + 1CP 7 )0. 9999998965 < 1. 

Note that the validity of our approach relies on the assumption that the solution of the 3x3 
system found by Maple is unique, which implies that the critical point of log F we found is indeed 
the global maximum (if an alternative solution exists it could happen that at the corresponding 
critical point the function F attains a value greater than the maximum obtained). 

In order to be more certain about the correctness of (21) we performed the following alter- 
native experiment: Let Vf be the polytope obtained by restricting V(n,5,~f) to the coordinates 
^ps,^pu5^nsi5^ns2 5^nu- Observe that this is a 3-dimensional polytope in M 5 , since its elements 
are determined by the values of the coordinates ^ps,^nsi,^ns2- We performed a sweep over this 
polytope by considering a grid of 100 equispaced points in each of the three dimensions. For 
each of the 100 3 fixed tuples of (^ ps ,^nsi)^ns2) which correspond to the points on the grid, we 
determine the remaining two coordinates of Vg, and maximize logi 7 restricted to those fixed 
values of I. Observe that in this case log F is strictly concave and thus has a unique maximum 
which can be efficiently found by any iterative Newton-like algorithm. We checked, again using 
Maple, that the value obtained for each fixed tuple of I is below the maximum in (21). 
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Appendix 



A Proof of Lemma 2.2 



In order to prove the lemma, we give a lower bound on the number of simple configurations and 
then divide it by the total number of configurations. The total number of configurations is 

(3cn)! 



6 cn (cn)! 



To obtain a lower bound on the number of simple configurations, we construct a subset of these. 
Consider the set of labelled copies of literals according to the degree sequence d. Among those 
copies which correspond to variables with one unique occurrence, we select any 2 \Qn 1 ' 3 ] (recall 
that in J\f(n,S,ei) there is a linear number of such variables). Call this set Uq. Let U be the 
set of all the remaining copies. We pick an arbitrary order of the copies in IA and put them 
into a list. At each step, we remove the first copy of the list, match it with two other suitable 
copies in the list (which are also removed) and repeat until the list contains exactly [^n 1 / 3 ] 
labelled copies of literals. Let x be the variable corresponding to the first labelled copy of the 
list at some step of the procedure. If we want to avoid repetitions of variables in clauses then, 
when choosing the two other copies to be matched with that first one, we have to exclude the 
at most 27J 1 ' 6 other copies of x (recall that in J\f(n, d, e\) the degrees of all variables are at most 
2n 1 ' 6 ). If we want to avoid multiple clauses, we have to exclude the at most Sn 1 ' 3 copies of the 
variables which are already in some clause together with some copy of x. Therefore, at most 
|"9 n i/3"| CO pi es a re excluded in total at each choice. This gives rise to at least 

(3 C n-3r9nV3])! 



6 cn -r9« 1/3 l (en- f9nV3])!' 

different ways of grouping the copies in U into clauses except for the [On 1 ' 3 ] copies which 
remain in the list at the end of the procedure. Finally, each one of the [9U 1 ' 3 ] remaining copies 
is matched arbitrarily with any two copies in Uq to create a clause, and this completes the 
construction of the configurations. The choice of U$ guarantees that all the configurations we 
obtained are simple. Hence, dividing (22) by the total number of configurations gives a lower 
bound of e -©(« 1/3 l°g"). 

B Proof of Lemma 2.3 

Given <p G C n ^ with d € J\f(n,d,€i), we apply the following pure literal elimination algorithm 
to ip: at each time step t > 0, one literal occurrence is chosen uniformly at random from all pure 
literal occurrences appearing in <p = ip(t) at time t (if the variable of the chosen literal occurrence 
is not yet assigned a value, it is set such that this literal occurrence is satisfied) and the clause 
containing this occurrence is eliminated. The algorithm stops when there is no more pure literal 
occurrence left. The resulting configuration is denoted by <p. It is easy to see that <p is unique and 
independent of the order of elimination of pure literal occurrences. Moreover, at any time step 
t, the algorithm retains the conditional randomness of the configuration, i.e., conditional under 
having any (fixed) degree sequence d at time t, any configuration (p obeying this degree sequence 
and whose image (j) = ir(<p) is a simple formula, is equally likely to appear at time t in this 
algorithm. To analyze the expected degree distribution of <p, we introduce the following variables: 
for any < i,j < n, Yij(t) denotes the random variable counting the number of variables with 
i positive and j negative occurrences at time t, and Yo(t) denotes the random variable counting 
the number of pure literal occurrences at time t, i.e., Yo(t) = J2i>o^Yifl(t) + ^2- >0 jYoj(t). Also 
define by L(t) the random variable counting the total number of literal occurrences at time t. 



Clearly, L(t + 1) - L(t) = -3. Note also that Y (t) = L(t) - ^ iJ>0 (i + j)Y id (t). Defining 
by G(t) the sequence of Yo(t), Yi t i(i), . . . , Y n>n (t), we have that for any t, for any i,j>0 and 
conditional upon any values of G{t) 

E(Y itj (t + 1) - Yij(t) \ G(t)) = |((i + l)Y i+ld (t) + (i + l)y iJ+ i(t) - (i + i)y M (t))(l + o(l)), 

E(y (t + 1) - y (t) | G(t)) = I |(E^i( f ) + E^ f ) " y oW) - ! ) C 1 + (!))- 

\ i>0 j>0 J 

If we now interpolate the variables L(t) and Yij(t), by defining the scaled versions yij(t) = 
^Yij(tn) for i,j > 0, £{t) = ^L(tn) and yo(^) = l;Yo(tn), (to understand the limiting behavior 
as n tends to infinity) this suggests the following system of differential equations: 

di 



-3 

dt 



dy%j _ 2 



((i + l)y i+ ij + (i + l)yi,j+i ~{i + j)yi,j), i,j > 0, 



dt 

i>0 j>0 

with the initial conditions £(0) = 37, yo(0) = Si>o^ d(i,0) + ^ 3>0 i d(0,j), and for i,j > 
we have yij(0) = d(i,j), where d(i,j) is the scaled number of variables of ip appearing i times 
positively and j times negatively. It can be easily seen that £(t) = 37 — 3t. Furthermore, defining 
the auxiliary function b(t) = (1 — 1/7) 2 ' 3 , it can be checked, that a solution (its uniqueness will 
be proven below) for the system of differential equations involving i/ij is the following function: 

ViJ® = EE <*(*'*)(*) ( £ )(Ht)) l+3 (l - b(t)) k -\l - bit))''*- 
k>n>j W U/ 

Also, it follows then that 

y (t) = 3 7 - 3t - E(* + J)^"(*)- 

j,i>0 

We are interested in the smallest value of t for which yo(t/n) = 0. This value of t will be the 
stopping time Tp of our process (for our particular system of differential equations, starting 
with (p € C nt d with d € J\f(n,S,ei), we empirically observed that Tx>/n is between 0.15 and 
0.16, and hence the resulting configuration Ip a.a.s. still contains many literal occurrences. This 
observation is used to simplify the statement of Wor maid's theorem). 

To prove the uniqueness of the solution and the concentration of the resulting distribution, 
we use the following version of Wormald's theorem tailored to our specific system of ODE's: 

Theorem B.l (Wormald [18]). Suppose that for any value of G(t) and any Yij(t) (throughout 
the statement of the theorem y,j(t) also includes the case ofYo(t)) we have 



E[Y hj (t + 1) - Y hj (t) I G(t)) = fij(t/n, {Y hl (t)/n} ltj ) + 0(1/ 
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where the fcj : R" -> 1 (o being equal to the number of variables Yjj plus 1 for the variable 
t) are continuous functions with all Lipschitz constants uniformly bounded. Suppose also that 
\Yij(t + 1) — Yij(t)\ < [3, and suppose that there is an order of the functions fyj such that any 
fij depends on at most the terms preceding it in this order (including itself). Then, for some 
large enough constant C and < a < 1/3, the system of differential equations 

dyi,j{x)/dx = fij(x, {yij(x)}) 



with the initial conditions yij(0) = Yij(0)/n has a unique solution and moreover, for any t < 
Tx> — Cn x ~~ a , we have 

Y lJ (t)=ny lJ (t/n) + 0(n 1 - a ) 

with probability at least 1 — 0(n 2+a e~ n ). 

Defining the /ij's to be the right hand sides of the system of differential equations of the 
yij, i.e., 

, ( s _ 2 {-{i + J)yi,j + (* + l )vi+i,j + (i + l )vi,j+i) 

Ji,j\ x i Vi,ji Vi,j+li Vi+l,j) — 



37 — 3x 

and 

2 

fo(x,y ,yi ! i,y 2 ,i, ■ ■ ■ ,y n ,i,yi,2, ■ ■ ■ ,yi, n ) = 3 _ 3 (-2/0 + E^ 1 + ^2jVi,j) ~ h 

it is easily seen that these functions are continuous and satisfy a uniformly bounded Lipschitz 
constant. Next, it is a routine calculation to show that for any 1 < i,j < n 

|E(y M -(t + 1) - Y iyj (t) I G(t)) - f itj {t/n,Y itj (t/n),Y ijj+1 {t/n),Y i+1J (t/n)) \ = 0(l/n) 

and 

|E(y (t + 1) - Y (t) I G(t)) - / (t/n, Y (t/n), ..., Y hn (t/n)) \ = 0(l/n). 

Also, we have 

max \Y i:j (t+l)-Y i:j (t)\<2 and \Y (t + 1) - Y (t)\ < 3. 

l<i,j<n 

Moreover, we can rearrange the /ij's in descending order of the sum of i and j (being fo 
the last element of this order), and hence we can apply Wormald's theorem. That is, for any 
t <Tx> — Cn l ~ a , we have for any i,j > 

Y i j(t)=ny i j(t/n)+0(n 1 - a ) 

and 

Y (t)=ny (t/n)+O(n 1 - a ) 

with probability at least 1 — 0(n 2+a e~ n ). Since in each step lo as we U as Yi,j change by 
at most 3, the change of Yij and Yq caused by the eliminations of pure literal occurrences after 
time Tx> — Cn l ~ a is at most (^(n 1- "), and thus the concentration results for Yij(t) and Yo(t) 
hold throughout the whole algorithm. 

Now, since originally we start with <p € C nt d with d £ J\f(n,6,ei), we have \d(i,j) — 

-pi — I — e i f° r an y hj *= N. Since the functions /j 7 - and /o appearing in the system 

of ODE's are continuous and their Lipschitz constants are uniformly bounded, by the continu- 
ous dependence theorem (see e.g., [17], Theorem A. 68) the solutions of the system of ODE's are 
continuous with respect to the initial conditions, uniformly for all n. That is, there exists an 
e' 2 > 0, such that the degree sequence d of ip satisfies \d(i,j) — 5(i,j)\ < e' 2 with probability at 
least 1 — 0(n 2+a e~ n "), where (using the notation of b and 5 from above) 

*(M) = EE *<*'*)(*) ( £ )(HTv/n)) i+ i(l - b(T v /n)f~\l - b{T v /n)Y-i. 

k>i e>j \ %// ^^ 

When plugging in the initial condition 8(k,£) = e~ 37 ^ 1 k \g\ — , the expression for 8(i,j) can be 
simplified and we obtain 



s (i , ])=e ->*m$^ 



1!;! 



where b = b(t) = (1 — t/7) 2 ' 3 . Setting a = 1/6 and unbalancing <p (i.e., changing the syntactic 
sign of a variable appearing more often negatively than positively) we obtain a configuration (p 
whose degree sequence d satisfies for all i,j £ N 

\d(i,j) ~S(i,j)\ < e 2 

with probability at least 1 — 0(e _v ^), where e 2 = 2e' 2 and S(i,j) is defined as follows: 

( 26(i,j), if i> j, 

S(i,j) = I S(i,j), if i = j, 

{ 0, if i < j. 

Setting (3 = 1/e concludes the proof of the lemma. 

C Proof of Lemma 2.4 

Let us fix k € {0, 1,2,3}. We will show that for a small enough e 2 > the probability that 
|cfc — 7fe| > e is 0((3 n ) with < (3 < 1. Given a random y> € C n)C f, we choose an ordering of the 
en clauses uniformly at random. For each t = 1, . . . ,cn, define the random variable Yj to be 1 if 
the f th clause has syntactic type k and otherwise. These random variables are all identically 
distributed, and we have that 

Pr(y . = 1)= ( e t)(t-k) = (3\ [V] fc [4n] 3 -fc i <t <cn 

Let S 1 = X/t=i ^* an( ^ consider the following Doob martingale 

S i = E(S\Y 1 ,...,Y i ), \/i,0<i<cn. 
We have S^ n = S and also S = c k n. Moreover, 

in = E»S = I -£ : — ; — en ~ I — f rr^cn. 



kj [£ p n + £ n n} 3 \kj (£ p + Q 

Since d e A/*(n, <5,e 2 ), by choosing e 2 small enough, we can guarantee that c is arbitrarily close 
to 7 and thus 

\S /n- lk \<e/2. (23) 

Moreover, by a case analysis depending on the syntactic type of the (i + l)'st clause (and 
depending on the value of k) it can be shown that |Si+i — Si\ < 2 for any i E {0, . . . ,cn}. Thus, 
by Azuma's inequality, 

Pr(|Sa,-S |>i*)<e-^. 

Setting R = en/2, and by (23), we get 

Pr(|c fe - 7fc| > e) = Pr(\S dn - lk n\ > en) < Pr(|5 ?n - S Q \ > en/2) < e~ e ^ = G(/T), 
for some < (3 < 1. Taking a union bound over all k completes the proof of the lemma. 



D Proof of Lemma 3.1 

We show the equivalent statement that log(i ? ) does not maximize on the boundary of V = 
V(n, 5,7). Let x be a fixed point on the boundary of V. Since the boundary of V consists 
only of non-negativity constraints of some parameters, this means that some values of £ a , tij, 
fi,j,ki c a, hnsi or ^ns2 of x have to be zero. It can be shown that the only £ a which can be zero 
is £ nS 2'- for all other £ a one can consider the linear program which contains all constraints and 
as objective function the minimization of the particular l a . It can be shown that the value of 
the objective function is in all cases except for £ nS 2 strictly greater than zero (using CPLEX). 
In the following we call all c Q for which ns2(a) > 0, all fajk with M > j > k > 1 as well as 
^■ns2 (cf. (17) and (18)) to be forced. We now make a case analysis depending on which of the 
remaining parameters is zero. 
Case 1: None of the £ a is equal to zero. 

In this case we know that at least one of the tij, fij,k, c a , /i ns i or h nS 2 is equal to zero. First 
observe that the interior of V contains at least one feasible point x~q: since all constraints are 
linear, one can add additional linear constraints to ensure that all values of £ a , tij, fij t k, c a, h ns ± 
and h nS 2 are strictly positive and then check the feasibility of the constraints (we did this using 
CPLEX). Since V is convex, we can find a direction that points towards xq. In this direction all 
the Uj, fij t ki Ca, Kisi and /i nS 2 components which were zero must increase (perhaps at differ- 
ent rates), while the non-zero variables can either increase or decrease (and remain non-zero). 
Hence, the directional derivative in this direction contains at least one term which is — log(0) 
(i.e. +oo) (plus/minus some constants) and hence it is an increasing direction. 
Case 2: £ nS 2 = 0, and in addition to the forced values of c Q , fij,k and h nS 2 at least one 
of the other values of c a , tij, fij t k or ^nsi is zero as well. 

Using a similar argument as in the previous case, we can find a feasible point X\ for which 
4is2 = 0, all forced parameters are zero, but no other parameters are zero (we did this using 
CPLEX). Again, by convexity of V , we can find a direction that points towards x\. In this 
direction £ nS 2 and all the forced parameters remain zero, but all other c a , tij, fij t k an d /insi 
which were zero before increase (there is at least one of them). Hence, the directional derivative 
in this direction contains at least one term which is — log(0) (plus/minus some constants), and 
hence it is an increasing direction. 

Case 3: £ nS 2 = 0, all the forced c a , fij t k and h nS 2 are zero, but nothing else is zero. 
In this case we move infinitesimally in the following direction: we increase £ nS 2 and decrease £ ns ± 

by the same infinitesimally small amount =: £, we increase c a for a £ 



1 I) ' \0 2J \1 

by C, we decrease c a for a € < ( ) , ( ) , I ) > by (, and we also increase /2,2,i an d 

ft-ns2 by C/2 and decrease /2,2,2 and /tnsi by (/2 (note that since all values of fij,k, c a and /i ns i 
which are not forced are strictly positive, all the changes are allowed). It can be checked that 
when performing all these changes, all constraints remain valid, and by convexity of V , we can 

move into this direction. In this direction the variables £ nS 2j c a with a = I I, /2,2,i an d 

hns2 which were zero before move away from zero, all other zero variables remain zero, and 
all other non-zero variables remain non-zero. Hence, the directional derivative in this direction 
contains one term log(0) and three terms — log(0) (plus/minus some constants), and hence it is 
an increasing direction. 



E Critical points of log F 

In order to characterize the critical points of log F and the value of the maximum of F, we need 
some definitions. Let us consider the functions 

*i,j = ^ps* + (fnsl + fns2) i ~ fns2 J V(i, j) G £, 

**= E ^T7^ps PS(Q Vnsi nSl(a Vns2 nS2(Q) Vfc € {0, . . . , 3}, 

p(a)=fc 
^ = fnsl + fns2, *p = ^psPps + 1, *n = ^nslAW + ^n S 2^ns2 + 1, 

defined on tuples {u, p) = (i/ ps , is ns i, f nS 2i P P s ; Mnsi, Pns2) of positive reals, and let 

L = V ps h ? s I Yl ^i/ 1 ' 3 ) ^ hnS ( ]J *k™ ) *p _Ap *n _An 

Lemma E.l. There is a bijective correspondence between critical points of log F in the interior 
ofV(n,8, : y) and positive solutions of the 6x6 system of equations 

V P)A (logL) = 0. (24) 

Moreover, the value of F at each of these critical points can be expressed in terms of the corre- 
sponding solution by 



' %/ \*n/ c \0ij / \'<US/ k=Q 



ps l * P 7 VW \ L \5u) VW MAt* 



Proof. We use the Lagrange multipliers technique to characterize the critical points of log F in 
the interior of V(n, 5, 7). Let (p'^Ac, p', (/4)fce{o,...,3} an d (pL, p' n ) be the Lagrange multipliers 
of the equations in (11), (12), (14) and (16) respectively. These equations are called partition 
constraints. Moreover, let (fp S , f^si) ^2) an d (a4s> a4si> A 4 ^) be the Lagrange multipliers of 
the equations in (17) and (18) respectively. After a few manipulations, the standard Lagrange 
multiplier equations can be written as 

U,j = Pi,jV v s l V(i, j) G C, f itjjk = ( ]Pi,jVn S l k Vns2 J ~ k V(i, j, k) G C, 

hnsl = P^nsl, h ns 2 = pVns2, 

9 

r — n f mi P s ( a )// H nsl ( a )„ ns2(a) u _ a 

w(a)l V ; 

^•ps — Pp^psPpS) <-pu — Ppj <-nsl — Pn^nslPnsl 

^ns2 = Pn^i^Pi^, 4u = Pin (26) 

where 

p lJ = e^- 1 V(i,j)e£, p = e p ~\ 

p k = e p '^ 1 VfcG{0,...,3}, P(7 = e~ p "- 1 VaG{p,n}, 

v„ = e Ua Vo" G {ps,nsl,ns2}, p a = e Pa Vcr G {ps,nsl,ns2}. 

Notice that the new variables defined above must be strictly positive. By combining (26) with the 
partition constraints (11), (12), (14) and (16), the variables (pij)c, P, (Pk)ke{o,...,3\ an d (p P ,Pn) 



are eliminated and (t, f, h, c, I) can be expressed just in terms of v a and \x a (a £ {ps, nsl, ns2}): 

,, i (j \ , . k , . j — k 

, _ , ^nsl , _ , ^ns2 

"-nsl — 'ins j j "ns2 — <lns -y j 

2 ps(a),, nsl(a),, ns2(a) 

StfSJT^P" '^1 ^s2 ; 

Ca = 7p(a) ^ V « <= A 

*p(a) 

« _ \ ^ps^ps /, _ , _J_ /, _ ^ ^nsl^-nsl 

*-ps — ^p _ j £pu — A p , t ns i — /\ n , 

^p ^p ^n 

4is2 = A n — , £ nu = A n -j— (27) 

^*=n ^n 

Then we plug these expressions into the constraints (17) and (18) and after a few manipulations 
we obtain the following 6x6 system on variables v a , [i a 



^ P s I Ap — I = ^ P s I 2^ Si >i — y~. — I + V> 

) =M PS J^7fc 
' \fe=o 



^nsi ( A n — J = z/ nsl | ^ <*m "j— + ^ns ^ I , (28a) 

'ns2 I A n — j - U ns2 ^ 2^ d »J vf - h ftns ^ I ' 



/V A 



■p— ^— )=<* X>— ST" 



.3 



9*n/^ nsl \ /^ a^/s^ 



AW ( A n I = aw 2^ 7fc -r , (28b) 

n ' \k=0 k J 






\k=0 

which can rewritten as in (24). □ 

By restricting our attention to (28b), we observe that each of the variables f ps , zv nsl and f nS 2 
appears only in one of the three equations, and also that we can easily express f ps , f ns i and v nS 2 
in terms of /i ps , /i ns i and /i nS 2- By plugging this expression into (28a), we obtain a 3 x 3 system 
in terms of the variables /z ps , // n si and /x nS 2 (recall that only positive solutions of the \x a which 
lead to positive v a are considered). We numerically solved this 3x3 system with the help of 
Maple, using 50 digits of precision and M = 23. The unique solution obtained by Maple is 

//p 8 « 1.9972796, fi nsl « 0.45029358, /x ns2 « 0.33794030, 

i/ p8 « 1.2782018, z/ nsl « 0.33277280, i/ ns2 ~ 0.95336927. 

In view of (19) and Lemma E.l, we evaluate (25) at these values and multiply this by B given 
in (20), and we obtain the bound 

EX < ((1 + 10" 7 )0.9999998965) n . (29) 



